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(57)Abstract: 

PROBLEM TO BE SOLVED: To flexibly 
perform data access in a distributed 
environment. 

SOLUTION: A host 140 is connected 
through a network 1 30 to a storage 
container 160 and connected through a 
communication channel 1 32 to a controller 
120. An agent 110 prepares a virtual 
mapping table by mapping real storage 
position on the plural storage containers 
160 and positions inside a virtual disk 150 
and stores it in a volatile memory 111. 
This table is stored on a semipermanent 
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memory 121 of the controller as well, and the table in the memory 1 1 1 
is intermittently replaced with the table in the memory 121. By means 
of distributed virtualization, the controller can manage a large number of 
virtual disks to be used for a large number of host systems and a single 
virtual disk can be shared by a large number of host systems as well. 
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dftfcfcD. AW* (I/O) »ffora. *^Mt x 

:/;u©^x> h u \zttTZ>WMikmifi79T<< zfitun 
n3<fc5^&£nT^£d<h£l#ifc<hT-5->XxA, > 
[«*JS3] If^^.fEmcDvXxAKfc^T. X- 
>>x. > hit. X> h U <D 1 OfC^-rSMS&ttSg^T^x 
4 7it£nT^2>m&, *X H:8i>hUSf^fcl 

at-rsv'XxA. 

Kx> h g iznrr&&m&vt&itfiTi?T<( fitzn 

[tt*^5] if*^4lB«W->XxAlC*3t/>T, X- 
-^i>h«. x>HJ©10K*f-r-5#§&#Ma*7i? 
f-f ^<t$tXT^»*«^. *X htC^X>h'JtC*J^-5 

XfAttll:, x-^x>h&tf:n>bD-- ^ 
A. 

[tt^7] «#Jg6lB«S©>'X^-AI::;fcUT, 3ft 
x-v*;n*. x-^SBjI7d ^:3;^£•ffl(,:>T)i{I^-v;^ 
CIS*^8] tt*^ lfB®©>-XT-At*3i/iT. x> 

MjtM-^-ty h^trct^itjat-r^vXT-A. 



[M*S10] It#^8fBf£ro^X>A^ii^T. it 
rA. 

imxmi i] ts^i otB«©->x^Afc*3t,iT, 

05 X>h'JfJHIC{RS7 ; ^Xi7{4ecD-tr^^>hS:^C 
<t$r#IS<h-r^^X^A. 

[»12] (gSxVX? • -fe^*>h£fB1ggB 
rtCDtBte®^r{CTy tf>:?U *X M< I /0®jf££x 
-5>x>Mc»frU x-yx>h*tAtb*»i^l'«L 

10 TfBte®Fft£ft;rr-5->XxAT<&oT. 

x> h >J (Dtttg£^-r*gg$c©^&<!:. 

15 x-7;i/£fB1i:t-£;*^ U<h;5>£&£d££#®£T5 
v'Xt-A. 

[gt^Jgl 3] IS^Jll 2fB«©->X?-AIC*3^T. 
X 'J d t *#m<i:-r -5 v-X^A. 

[gfjfcJgl 4] lisjc^l 2iES©->X^A(C*5UT. 
20 f3ffi©0ftt, IBttcSBF^x-^ • yD7?T*5ut 

[IS^l 5] sf^Jgl 4fBmw->X7 i AtC*5^T, 

25 [W*3S16] 2fE««->X^A(C*3^T. 

[IS*«17] aS*JSl 2fB%©vX^A('*5^T, 
30 A. 

[«*S18] lt*JSl 2fB«<D->XT-A{C*3UT. 

x> h u ©tttltelsattiiSr^tfd t&ftWit-?* ->x 
[ff^l9] 8fB«W->X^At*3^T. 

-rs->xxA. 

x> h u <Dttffi«^Sii^Si<&^tr d h Zftrnt-?* -> 

40 [»2£JR2 1] W*^2 OfB^WvXxAI^^HT. 

[If*J12 2] IS^l 2fB«cDvX^A(C*5UT. 
x> h >J(0«§i«-fe:n«*iSr^trc<i:^#®t-r-5->X 
45 f A. 

[IS*«2 3] §f*Jll 2iEK«->X^A(C*3^T. 
x> K UW«ffi«x^-t»iSg^^trc: t^ttiy 
7rA. 

50 tlfc(gS7 ; ^Xi'±T-SlfT$ns^feT-S)^T. 
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tetSSe^LT. <KS7MX*±<D®Ws<!:t§PI#<&.5 

{RSx^X^l;:. *ff£:-r-2)i!)f^©^T^*l^-r-57.x-y 
[tt#Jg2 5] W#JJ|2 4|E«©*^tC*V^T. 

[H#J?i2 8] IS^3S2 4|3®^ffit*5ViT, 
£\ l^§*y^-^£m&X7"yy£^ir£<»: 

nmm2 9] 11*352 4fam©*^t*i^T, ktj 

CIS*3R3 1] *y h*7->7 \Z&^Tl&My : ■< 7,2 • 
~7uyy «ett£B±OGttttffi -7 tf St"- 

[lt*3S3 2] lt*3S3 lfBig^TjffilCfc^T. ISETj 

umms 3] is*Ji3 i.iE«©*ft»c*u»T. 
ft. 

[!8#Jg3 4] *y hV-ZWz&ZKT, Hc*i§££ 
* ^flx^Jffc n - H #fB1g£ nfe n > t a. - * plijgig 



^■jhc 7 ? -tx-t -s x t- -y yt . 

[»#H3 5] *y h 7 -?\Z&1,*T ISM? 4 X£ ■ 

tt S tifc n > fcf 3. - ^ Ggffl TOKfltt* 6 & -5 n > fcf » - 

T— y;PS:fB1g-r-SX-v'x>HfCt5t.iT. n>hD- 
20 7*^37>HS§it5Xr7yi> 

7"-y;KDX> h U WtC*3^T«^*7^7^^ -Jit? -5 

t— ^MC *5^TilifP^^7-r -5 X t- -7 7 £ . 
nv>h*CJSSUT. 7 i -^';i/^Mff-r^X7">7 7 , 'i:^ 
25 nfi-?2>£oiZffil&2tlT^Z>=l>¥3L — ? • -futfy 

[0 0 0 1] 

[K^m^] *tU^«. 2 0 0 0^6^ 2 BtCdd^^n 

30 fzxm&mmm 6 o / 2 0 9 , io9^wi6o/2 

0 9, 3 2 6^6©ffi5fe«<£i3I-r-5., ^(058^^ 
[0 0 0 2] 

[%B^coJSTl»ft«7>S] Am^jv-ytf>^ 
35 CcfcStfffcl^-y^ffltAfc, ig^b^-^IBtti^x 

[0 0 0 3] 

[ft*<D&ffi] i^nX^-^J. 01 (A) 

fzmmmzTKt&o ic, bsia ^^^^xtt/N'x^ji 

40 A-H-^^X^. XDy tf • t^-CX^, 7=— 

^ R^H^'f^'co.tp^-^IBtiSBft^-r 

s. z\<D±?trMm^**Mt&m7 : -?fcm.&*jmz 

fBSSE^*frS7^-feXtt. ^tf3>hfrL- 
45 [0 0 0 4] ^mcO^A-fX^IBtt^ElC^ 

I, &&v>?.-^\z&z > T : -5$mvc*°\mz-$-z>z\£ 
tf&&iitS.^-C&tt. ^-cOi^m. 01 (B) 

ntr/W X*>e>^:2)X hu-y • IU7 • ^-7 h "7— 
50 ^ (SAN) *f^fi)cLfc. SANI1 1 -P«±<D 
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• o^-xx-x (scsi) A'xsrjiDT&igrrn 

#-5. SCSI yXfATIi, — M<DWiW3--V 

(LUN) ^ffl^T^-^fBtt^^fg^-TS. MBit 
M^fftt, S'J<H©f2tft^S. Xte 1 ootBteSS© 1 K# 

(zoning) f- • > ? trnZf-Siiit 2> 

ClitCfcoT. SANlt ^^-^IBiS'J V- 

XSWTSCt^TtS. C1C0S ANSr^Lfc^-^gB 

ttu v-xc5#*m, ^©x-^R^tBis^aeffl^ 
A-tiKH'Jirrs. torn**. fBtt&swrax h£#&© 

it. M*. Km?-? • XbU-WtiLmiZ&Q. «M 

^s-fe^j-'j^^^fc^^n^. cinw:. sanbt 

5Ct^5A>f,Ta65. SANCTIS. LUNM 

• xA-f xfUHcxr-f A • ^-v^Hg 
iSfflH5:tl:<toT, iSjl^-^Atti^ (I/O) 

anh *x hi/-y • 3 >f tr»i®siao: 

M/UtLTlfL, tOfctt, IBtSWS, 

[0 0 0 5] Z.nZ<Dfflm(Dfztt>\Z. SifiK&oT. (1 
S A N Jgg lC(g«{t; L-f -V *iiSD b fco <£ffl{fc 
-f ^IIScDfBttgB(D^ia^X«h#D^tc«S 
&ffl^fg&fB1g;£F B 1£(£S5^X?Xte#gx 
— Aldtfiy-r-avx h-?l7WA- • 

If-fXi'07y'7h7^->3> (abstract! 
on) ibT-9— Aco^l^-^ >if • vX^AtCil 
^£n, <5S#Ux-A^gx^X£T&-5;^©<fc5 

hU-y-7Kl«LUNT'tt&K &\Z, <g«*U 
x-A«. fBttit^->XT-A<i:«ffl5:bT, ffr&£n. 

&3S£n mmzn. &mzn. ^lt. jUrwkstk 

<5ItE&x hl/-> ! *ta;?I-r^lzbfc^oT. t£3I£n 

-5. SAN«tt, ^l/-f-f>^^7 7h7 ; t 
-AWj£^®Hlc«a-rS7^'J^-->3 MCttbT. 
SANU V-XCD#— -J-frRlfmrnttltJ. IfflcDS AN 



[0006] b7&>b&a<e., of^w^^x^R^tatg-y- 

7->XrA^ffl^5 S ANte. 3 >tf^-^ • ->X^A 
RtffBt&gB (XhU-v) IBWJMflcKffcSnT^* 
05 *^ £ c0<?D->X^ARy:fB1g 1 l : SiSffl^*^. 
cn£RO^<£ft!i<03ffi©;ta&K:. gEffWSANSffi 
X^r-^f'J^^ HIC. SANIRS 

A-f xKir*5ttsx hi^-v • uv— xoftttftt&s. 

10 [0 0 0 7] Lt^ot, SANK&W&cn's.&tf-e 

«ffi©sa^M-r-5. esc^^tx/tx-^iBtfexx^A 
at^enc^s. ji^^tifcfBie->x5 i A<Diatc, 

tT-yvX^A^fflt^T. ^iP&tf7^1rXtKflg£x-<Dft!l 
®IB|S^*^^-r^CttC<i;oT. SANOttfigS: 

is Mfcfa±£-t±.k5<bbjt ; t><Da<;&-5. croarovx^rA 
•m. 72±xmmt. sANicD^-^wffiffl&u^ 

20 -r^>ci<i:tc«toT, w&immtfiit— ;w6^$ns 
^tten-sx hi/-> J (D{gS{btxJ!ni^.T. san±©s 

25 »$tg©glMfc^fig<h&-5„ 
[0 0 0 8] 

£^©fl!l©iBft$fig;6^#-8tT3 t 5 ^ © g «J(CI^^ 
?T, S£Jffit£nT^£<KfflfcfBi&v'X5 i A«. 0 2 
30 (A) ~@2 (O t^tvetun-f <fc5f::. *Xh. X 

R^-7^t:>^*gtg^^bT^s. tt«xcxvytr> 

35 F^SSrHIiSI-r^. ba*b&*«6, 1 d©r: >7lt-^> h 
tc^tf^-li-fclBISteSft^fcti. «/r»X^r-U >^ 

<DMmw£.-rz>. z\n\z\t. >tr i- : ^ • ^>x 

5^A. ^(DfBtS->X^A. S^iillWitflg^Wr^^: 
^X M/-^ • ^-y hV — Z'^CDZ'r—'J 

40 ^it*«ssna. 

[0009] x^r-^euT- »»<ss<bia 

tS^Xo-AtC^^Ttf^dt^-C^S., b^b^**C, 
{RS^fBttvX^A^^b, <S«fBt6^*5^Tffl^^ 

45 navy tr>^£#ftTsii#-«. *tsa7;^uXA 

ffiiL^^X^<D7Lfi7U"f (RAID) *«Sn5. 

mmm<t, iBte->XT-A©as. s^x-^ia^©^ 

50 tt<0Pn1S^^tJ. X^r-7y;^{S«fBte->XxA©il 
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[0 0 10] 

mm&mik-fztcvxD^m] z.nt><Dwm\zfoCT. 

X h U— v h V — 2 IZ $5 Qx.ffl.it b 

Xb • ^XxA*tffl^S#*<7)iKffl'r-f x^*«ffi"rs 
dit>pIIBi&D, *— ©fia^-f X#£^SfcO*Xh 

«SfPfflLa^«t5lc-r*CtfcckO, {RfflEtti/X'T- 
AOXir-^KUT"^ &^<KSIB«f>X^-A<7)=i># 

[0 0 11] 

\Z* *X M 4 ORtf^ H/-i? • n>ft 1 6 OHO 
AfflA (I/O) »ffs<&&«>K: l oBUiOfiSx* X* 

1 5 oswt^^«)Mi&, iK±<0#«"7ye> 

fif'- 7— >0P2 0 OSJH^fc, £gXM/-y • XU7 
•*7h7^ (SAN) HP«. 

2 0 011 <SS^X^ 1 5 0ftO&B£X b\s—zs • 

Tit ttTTlKBKRIIf*. 
[0 0 12] *58W«, XH/-y -XU7' *y h7 
(SAN) <OSkjaS^«i:"rs. L&tfoT, *58 
»H M/-y • h>7-^ i 3 0 

\zmm»imrc$>z>* sanwtii xn/-yo>f 

tl6 0H &*0<Ot><DT&oT, r^X^Mr-y 

XM4 0H a>t!n.-^, :/U>*«, 
^l:8iUhl/-v • l 6 o^^o^-^ic 

[0 0 13] nmz. X b S> «^7h7-^130 



(scsi) xtt^T-r/*- ^**;i^a*©*«5tt« 
05 ^fix bu-zs&mm^m-c&Q. fiis^is© 

*«ffi«<D 1 OTtt* X M/-y • *y by-? 1 3 0 
10 *T^>o L^b&a*€>, yXrMOOORIHl 

SCOX hU-v • h»7-^ 1 3 OJilC&ttStefflfc 

[0014] *xH40H i /oWiftzj-vy bzfc 

15 fflrf^T.^ 1 5 OCl^fTU ^nCMlt, T ytT> 
^ • X-yx/h 1 1 0^-y;U2 0 O^T^-feX-T 
X— vx>M 1 Ott— $i\Z\ZC<Dl:5\ZVT*7s 
M 40 izMMttW btl&W* X— 5?x>h 1 10«r 
-y;i/2 0 0£*X h 1 4 0<D7^X^6IBiB£-fr 

20 $?£L<te, *X h 1 4 O©&*teBiJfi<0X--^x 
>M10^U &*X h*tgijfi©Tye>^ ■ 5=~ 
^1/2 0 0^*T^cJ:3^-r^o *-5^tt, yXrAl 
0 0 tt v »&0*X hrt*X-v;x >M10 tsarr* 
cfc3t«*-r*Cl4:t>prfiB-X?**. ^&CD*Xhl40 

25 fflCx-yx>h 1 1 0 Cgat^i^, ^XM4 
■Ott, **J6©t— 0 0K#^*7*izX£**rr 

[0 0 15] X-yx>h HOB, ftSWi:«DRA 

Mif^^jf^tt^^u i i i \z^y tr>^ • T--y;i/2 

30 0 OSr BUTT*. X-yx>h i io<on 

#>-3 y h^*>Tfeto'&f?«Lfc»^ f«)X»yx 
>M 1 0teg#£>7=— 370U2 0 0O3t!-^^o #J 

^y\z>tf- x— >*x>h i i o^xh • ->x 
i 4 0rti:i*ain, ^-com^^fe^x h • 

35 X^i^SStt**^ T7tf>^' X->?x>hiL 

XM40H X-;xx> h 1 1 0^<Dlrt£i&#rf£> 
^(hl^oT, • X-vx>h 1 1 0 £f?ik 

40 "55tt^ ; EUl:Eltt-5Ci:l:J:oT, r-^2 0 0 
te, X-yx>h 1 1 0±T§gH7i7tXRtf^5^ 
Pltetfc*. MK, T^tf>^- ^-^1/2 0 0 *JS% 
14 * ^ U Etftf -5 CI <h l:<fcoT, X-yx>hl 10 

45 ^»ffi***B^««-r<5t^5»J<OfiJj«t)»6n4. 
^ffWfC, X-yx>M 10H ttttlC*«a^!yfcJ 

[0 0 16] SEtC yXfA 10 0H 3>hP-7l 
50 2 0 £ffiA., X-yx>M 1 0 <hte#J<lT£>£7^ V 
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«>(;:. 3>hD-5 1 2 0 

3>hD-7l20B. X;P2 0 0 

£0 **'J 1 2 1 KfcBI&U VyV><D'&\Z>b 

□ >hD-7 1 2 Od^-X^ 0 OSrffi^-r^.tP^ 

ts. ¥*^^^u i 2 i«, ^roisiets^ast/ies^ 

©bTfc«fcl^. Z.(D£.5\Z. 3>ho-7l20ll n 
> hO-7 1 2 0*i->-^^ h^r7>Xtt^f|LTfe. T- 
-:7>2 0 

[0 0 17] £©<fc5fC. V>y tT>y • 5=— ~7)V2 0 0 
©^MfBtftte. a> 1 2 0©&giJT&9. n 

<D|g^-e« v vx^A^ttwtftifi. ftewc-r^if^ 

• I-yi/h 1 1 0Wa>hD-7 1 2 OF^O-f > 

^•O^lr, £x— >?i>h 11011 3>hD-7 1 2 
0O#.£:fflSfe/8£fTl'>. Iffll-^iVM lOitt 
fT^«cto^:^c:t*W*b^. ^<Dm^ ~>X^A 1 0 0 

[0018] uT-zmwrz^^iz, a>hD-7 1 2 

ORtfx— ^x>h 1 1 0©fflSf£/B«. «£^RtfU;y- 
.->«tTr^«$n-5 0 m3\Z7F-?lRB-?y \d>if • >X 
^A©-5liS^SgT«. Hftte. si[f^*;n 3 2© 

A 1 0 0 CDSiJ©*Jgli^T*«. mmfr *)l 13 2H 
X h V—V • *y h7 — £ 1 3 0 £#T?&-5. n"7> 

tsca, ss[©ftftT-*n«fl&.«© © <&ffl v» -5 c: i 
jifi^*;n3 0te. tcp/i pwip 

fc&ffloJ&T&S. 3>hD-7 1 2 0©8&6IRtf7:7 

[0 0 19] 0 5K.tfl2 6te. 7^— X;P2 0 ow^gs 

te. 1 Z>U±<D&mT-< X£ ■ -ttf*> h 2 2 0 h 
U->> • 3>a-± 1 6 0±©tetS^R)?2 3 0 tWFelWT 
7e>^tfi>MJ210 (fr) £#tf. ffitg©^ 
2 3 0H <1* ©^ h • n >-r^- 16 0. Rtfig 



S5VX? • -f >^.yi7X2 2 OCtt^-T-SX hU-y 

• 3>ftl 6 OWgg^Sr^T'S. tetgfiSf?r2 3 0© 

fflt^sx hu-i? • by-zizztLrmw 

-V&t*nt2&&ttfr>. SCSI^'y^-yTH IE* 
05 mm 2 3 0©&*Ki. LUN^gU : ?2 3 3<tv 

• T—Zffr -I>HJ2 1 0^(Dft!l©7-f — ;PK£T 

[0 0 2 0] S 5 iC^fcfc 5 C, • 7---J)V 

10 2 0 01i, [EJBtVx^ 2 2 0©& r^X:7 • Xn«.y 
in SCl^©l>hU 2 1 O^ttSdtA'TtS. 
Ctl«^5I^T'«$>-5*t. tf>y • X;y#g 
?lft£0rJMl2U&'?ye>jf&ttU SSL 

15 ^ • t— X^ •I>h'J2 10ll f3HX h U-v> • n 
>?7" 1 6 0<Ol-Diz&tf2>m&7ciy?\z-?-y \Lyy 
•T£, Bl£-tf-rX<0Bi8<K«7 i -f X:7 -7U7?' 

-X£«-r. ^©^-^2 0 0 wafigts, T-ytf>^ 

[0 0 2 1] -75. Jffatt^EJSJPIROT— ^2 0 0 
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1 . Title of Invention 

Virtual Storage System 

2 . Claims 

1- A virtual storage system for linking a host to one or more 
storage devices over a network* the system comprising: 

an agent connected to the host, the agent having volatile memory 
for storing a first copy of a table, the table having entries to map virtual 
disk positions to locations on the storage devices; and 

a controller coupled to the agent, the controller having non-volatile 
memory for storing a second copy of the table, the controller intermittently 
causing contents of the first copy of the table to be replaced by contents of 
the second copy of the table, 

whereby during an input/output (I/O) operation* the host accesses one of 
the entries in the table etored on the agent to determine one of the storage 
device locations. - 

2. The system of claim 1, wherein the table entries further 
include an indication of whether an invalid state is activated such that the 
invalid state for a table entry becomes activated when that table entry 
contains no useable mapping information. 

3. The system of claim 2, wherein the agent does not allow the 
host to complete the I/O operations with one of the entries if the invalid 
state for that entry is activated. 

4. The system, of claim 1, wherein the table entries further 
include an indication of whether a no-write state is activated such that, the 
no- write state for one of the entries becomes activated when data cannot 
be written to the storage location contained in that entry. 

5. The system of claim 4, wherein the agent does not allow the 
host to write data to the storage location in one of the entries if the no- 
write state for that entry is activated. 

1 
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6. The system of claim 1, further comprising a communication 
channel to couple the agent and the controller. 

7. The system of claim 6, wherein the communication channel 
employs a data transfer protocol to transport messages on the 
communication channel. 

8. The system of claim 1, wherein the entries include an offset. 

9. The system of claim 8, wherein the offset includes logic unit 
number identifier. 

10. The system of claim 8, wherein the offset includes a block 
identifier. 

11. The system of claim 10, wherein the entries further includes 
a segment of virtual disk positions. 

12. A system for mapping a virtual disk segment to a storage 
location within a storage device, such that a host issues a I/O operation to 
an agent and the agent determines said storage location for input/output 
operations, said system comprising: 

a table having an entry corresponding to said storage location; 
a plurality of variables indicating states of the entry; 
an offset for the entry, wherein the offset includes a logic unit 
number identifier and a block identifier; and 
a memory to store the table. 

13. The system of claim 12, wherein the memory is volatile. 

14. The system of claim 12, wherein said storage location 
comprises a block of data within the storage device. 

15. The system of claim 14, wherein the block of data is about 1 

MB. 
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16. The system of claim 12, wherein the agent is coupled to the 

host. 

17. The system of claim 12, wherein the plurality of variables 
comprise Boolean variable. 

18. The system of claim 12, wherein the states include an invalid 

state. 

19. The system of claim 18, wherein the plurality of variables 
includes a variable for the invalid state. 

20. The system of claim 12, wherein the states include a no-write 

state. 

21. The system of claim 20, wherein the plurality of variables 
includes a variable for the no- write state. 

22. The system of claim 12, wherein the states include a zero 

state. 

23. The system of claim 12, wherein the states include an error 

state. 

24. A method for performing an operation on a virtual disk 
coupled to a host within a network, comprising: 

specifying a block on the virtual disk within the operation; 
accessing a table mapping the block to a storage location on a 
storage device; 

issuing a corresponding operation to the storage device, wherein the 
corresponding operation correlates to the operation on the virtual disk; 
completing the corresponding operation; and 
presenting the completed corresponding operation to the virtual 

disk. 

3 
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25. The method of claim 24, wherein the issuing step includes 
issuing the corresponding operation from an agent coupled to the host. 

26. The method of claim 24, further comprising updating the 
table with a persistently- stored table residing in a non-volatile memory. 

27. The method of claim 24, further comprising determining 
states of the table. 

28. The method of claim 24, further comprising sending a fault 
message when the table is unable to be accessed. 

29. The method of claim 24, further comprising storing the table 
in a volatile memory. 

30. The method of claim 24, receiving updates for the table from 
a controller. 

31. A method for maintaining a table for mapping virtual disk 
blocks to storage locations on storage devices within a network, 
comprising: 

receiving a command from a controller at an agent storing the table; 
activating states within entries of the table; 
completing operations at the table; and 
updating the table in response to the command. 

32. The method of claim 31, further comprising setting a blocking 
flag until operations are completed. 

33. The method of claim 31, further comprising obtaining 
mapping information from one of the entries in the table. 

34. A computer program product comprising a computer useable 
medium having computer readable code embodied therein for performing 
an operation on a virtual disk coupled to a host within a network, the 

4- 
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computer program product adapted when run on a computer to effect steps 
including: 

specifying a block on the virtual disk within the operation; 
accessing a table mapping the block to a storage location on a 
storage device; 

issuing a corresponding operation to the storage device, wherein the 
corresponding operation correlates to the operation on the virtual disk; 
completing the corresponding operation; and 

presenting the completed corresponding operation to the virtual 

disk. 

35. A computer program product comprising a computer useable 
medium having computer readable code embodied therein for maintaining 
a table for mapping virtual disk blocks to storage locations on storage 
devices within a network, the computer program product adapted when 
run on a computer to effect steps including: 

receiving a command from a controller at an agent storing the table; 

activating states within entries of the table; 

completing operations at the table; and 

updating the table in response to the command. 
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3 . Detailed Description of Invention 

Related Applications 

This application claims priority from U. S. Provisional Application 
Nos. 60/209,109 and 60/209,326, filed on June 2 f 2000, the disclosures of 
which are hereby incorporated by reference in full. 

Field Of The Invention 

The present invention is a virtu ah zed data storage system usin? 
distributed tables with input/output mappings. 

Bastosaand Of The Jnventjon 

A stand alone computer generally connects to data storage devices, 
such as hard disk, floppy disk, tape, and optical drives, via a fixed 
communication channel or bus, as schematically illustrated in FIG. 1A. 
While the communication channel allows higrh-speed data transfers, access 
to the storage device is limited to the stand-alone computer. 

Over time, it has become necessary for multiple devices to connect 
to a storage device so that multiple users may share data. As a result, 
developers created a storage area network (SAN) consisting of multiple, 
interconnected, proximately located devices, as schematically illustrated 
in FIG_ IB. The SAN typically includes one or more network access 
servers that administer the interaction of the devices and the operation of 
the network, including data storage devices that are accessible by the 
other devices in the SAN. The devices may be connected through Small 
Computer Systems Interface (SCSI) buses to establish parallel 
communication. channels between the devices. In SCSI systems, a unique 
Logical Unit Number (LUN) is used to designate data storage locations, 
where each location is a separate storage device or partition of a storage 
device. Each L.XJN is further divided into blocks of small, easily 
manageable data sizes. By combining LUN zoning: with port zoning: to . 

6 



- 18 - 



2004 03 26 16:40 



1$^ 2002-91 706 



implement storage sharing, the SAN can have centralized, distributed 
data storage resources. This sharing of data storage resources across the 
SAN substantially reduces overall data and storage management 
expenses, because the cost of the storage devices maybe amortized across 
multiple devices. The use of centralized, distributed data storage also 
provides valuable security features because the SAN may limit the ability 
of a device to access data in a particular zone. The performance costs of 
using consolidated data storage, configurations within the SAN are 
substantially reduced through the use of Fibre Channel connections 
between the LUNs and the other network devices to achieve high-speed 
data input and output (I/O) operations. The SAN operates, in effect, as an 
extended and shared storage bus between the host and the storage 
containers to offer, among other things, improved storage management, 
scalability, flexibility, availability, access, movement, and backup. The 
centralization of data storage, however, presents new problems, including 
issues of data sharing, storage sharing, performance optimization, storage 
on demand, and data protection. 

Because of these issues, developers have recently added a 
virtu alization layer to the SAN hierarchy. The virtualization layer refers 
to software and hardware components that divide the available storage 
spaces into virtual disks or volumes without regard to the physical layer 
or topology of the actual storage devices. Typically, virtual volumes are 
presented to the server operating system as an abstraction of the physical 
disk and are used by the server as if virtual volumes were physical disks. 
The virtual volumes are not LUNs on a storage array. Instead, the virtual 
volumes may be created, expanded, deleted, moved, and selectively 
presented, independent of the storage subsystem. Each has different 
characteristics, and therefore expanded as the available storage expands. 
The SAN virtualization presents a single pool of SAN resources and a 
standard set of SAN services to applications residing on a broad range of 
operating platforms. 
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However, SANs using conventional disks and storage subsystems 
incur substantial system and storage management expenses due to the 
tight coupling between the computer system and the storage. Because of 
these and other reasons, the existing SAN technologies also have limited 
scalability. Furthermore, a key remaining issue for SAN visualization is 
the distribution of storage resources among the various devices of the 
SAN. 

Accordingly, there exists a need for an improved data storage 
system that addresses these and other needs in the SAN. One proposed 
class of storage system uses a subsystem to further improve the 
performance of the SAN by separating control and access functions from 
other storage functions. In such a class, access functions govern the 
ability to use and manipulate the data on the SAN, and control functions 
relate to the administration of the SAN such as device monitoring, data 
protection, and storage capacity utilization. Separating control and access 
functions from other storage functions pulls the visualization function out 
of the server and onto the SAN. In addition to the virtu alization of the 
storage provided by traditional, server bound implementations, the 
visualization layer on the SAN enables the automation of important data 
movement functions, including the copying, movement, and storage of 
data through the creation and expansion of virtual volumes. 

Toward this purpose of separating control and access functions from 
other storage functions, currently proposed vlrtualized storage systems 
consolidate control and mapping functions in a centralized location such as 
in the host, in a storage controller, or in a special virtu alization component 
in the SAN, as illustrated in FIGS. 2A-2C respectively. Centralizing the 
control and mapping functions avoids problems associated with 
distributed mapping. However, storage visualization schemes that are 
centralized in one component suffer from various scaling limitations, 
including the inabilities of scaling to multiple computer systems, multiple 
storage systems, and large storage networks with adequate performance. 
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Improved scalability may be achieved through a distributed 
virtu alized storage system. However, attempts to form distributed 
virtualized storage systems through the use of known technologies, such 
as array controllers, for distributing the mapping used in the virtual 
storage use simple algorithmic distribution mechanisms that limit data 
management flexibility, e.g. Redundant Array of Independent Disk 
(RAID). Furthermore, the known technologies do not address the needs of 
a scaleable virtual storage system, including issues of storage sharing, 
data sharing, performance optimization, storage system delays, and data 
loss risks. 

SUMMARY QF THE INVENTION 

In response to these needs, the present invention provides a system 
and method for creating virtualized storage in a storage area network 
using distributed table- driven input/output mapping. The present 
invention distributes the visualization mapping in multiple parallel 
mapping agents that are separate from a controller. This configuration 
allows the performance -sensitive mapping process to be parallelized and 
distributed optimally for performance, while the control of the mapping 
can be located in the controller chosen for optimal cost, management, and 
other implementation practicalities. The mapping agents store the virtual 
mapping tables in volatile memory, substantially reducing the cost and 
complexity of implementing the mapping agents. The controller is 
responsible for persistent storage of mapping tables, thereby consolidating 
the costs and management for persistent mapping table storage in a single 
component. Distributed visualization also allows the controller to 
manage multiple virtual disks used by multiple host systems, and even 
allows a single virtual disk to be shared by multiple host systems. The 
mapping agents preferably do not interact with other mapping agents, 
thereby improving the scalability of the virtual storage system and the 
virtual storage system's tolerance of component failures. 
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DETAILED DESCRIPTION OF A PREFERRED EMBODPylErJT 

As Illustrated in EIQS.NLfc-gB and 4A-4B^the present invention 
provides a virtualized storage area network (SAN} system 100 using one 
or more distributed, mapping tables 200, as needed to form one or more 
virtual disks ISO for input/output (I/O) operations between the hosts 140 
and storage containers 160. In particular, the table 2O0 contains a 
mapping that relates a position in the virtual disk 15 O with the actual . 
location on the storage containers 160. The specific contents of the cable • 
200 are described in greater detail below. 

The present invention covers an improved storage area network 
(SAN). The invention can therefore be applied to any known storage 
network ISO. Withixi. the SAN, it should be appreciated that the storage 
containers 160 are known and may refer to any type of present or future 
known programmable digital storage medium, including but not limited to 
disk and tape drives, write able optical drives, etc. Similarly, the hosts 140 
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may be any devices, such as a computer, printer, etc. that connect to a 
network to access data from a storage container 160. 

Likewise, the storage network 130 is also intended to include any 
communication technology, either currently known or to be developed in 
the future, such as the various implementations of Small Computer 
Systems Interface (SCSI) or Fibre Channel. This distributed visualization 
is most useful in environments where a large amount of storage is 
available and connected using some sort of "storage network" 
infrastructure. In one preferred implementation, the storage network 130 
is based on Switched Fibre- Channel connected storage. However, nothing 
in the design of the system 100 precludes its use on other types of storage 
networks 130, including storage networks that are not yet invented. 

The hosts 140 issues I/O operation commands to the virtual disks 
150, and in response, mapping agents 110 access the table 200. Although 
the agents 110 are generally associated with the hosts 140, in this way, 
the agents 110 isolate the table 200 from general host 140 access. 
Preferably, each of the hosts 140 has a separate agent 110, so that each 
host has a separate mapping table 200. Alternatively, the system 100 
could be configured so that more than one host 140 connects to an agent 

110. If multiple hosts 140 connect to the same agent 110. the hosts 140 
share access to the particular table 200. 

The agent 110 stores the mapping table 200 in a volatile memory 

111, typically DRAM. As a result, if one of the agents 110 shuts down or 
loses power, that agent 110 loses its copy of the table 200. For instance if 
the mapping agent 110 is embedded in the host system 140 and takes its 
power from that host system, as would be the case for a backplane card 
that serves as the mapping agent, the host 140 may cause the mapping 
agent 110 to shut down by eliminating power to the agent 110. However, 
by storing the table 200 in volatile memory, the table 200 can be eaeily 
and rapidly accessed and modified on the agents 110^ Storing the 
mapping table 200 in volatile memory has the further advantage of 

1 I 



- 23 - 



2004 03 26 16:40 



ftffl 2002-91 706 



substantially reducing the cost and complexity of implementing the agents 
110 as mapping components. Overall, the agents 110 allow the 
performance-sensitive mapping process to be parallelized and distributed 
optimally for performance. 

The system 100 further comprises a controller 120 that, although 
separate from the agents 110, administers and distributes the mapping 
table 200 to the agents 110. Control of the mapping is centralized in the 
controller 120 for optimal cost, management, and other implementation 
practicalities. The controller 120 further stores the table 200 in a semi- 
permanent memory 121 so that the controller 120 retains the table 200 
even after a power loss. The semi-permanent memory 121 is preferably a 
magnetic disk because of its high storage capacity and fast, frequently 
writing capabilities. The controller 120 may alternatively store the table 
200 using other forms of programmable storage such as writeable optical 
media and electronically programmable memories. The controller 120 
thus continues to store the table 200 even if the controller 120 shuts down 
or loses power. 

In this way, the responsibility for persistent storage of the mapping 
tables 200 lies in the controller 120, consolidating both costs and 
complexity. The exact design of the controller 120 is not a subject of this 
disclosure. Instead, this disclosure focuses on the structure of the overall 
system and the interfaces between the mapping agent 110 and the 
controller 120. Accordingly, it should be appreciated that any controller, 
as known in the art of digital information storage, may be employed as 
needed to implement the present invention. Within this framework, each 
of the agents 110 preferably interacts with only the controller 120 and not 
with the other agents 110. As a result, the system 100 is highly scaleable 
and tolerant of component failures. 

As described below, the interactions of the controller 120 and the 
agents 110 are defined in terms of functions and return values. In one 
embodiment of the virtual mapping system 100 illustrated in FIG. 3A, this 
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communication is implemented with messages on some sort of network 
transport, such, as a communication channel 132. In another 
implementation of the system lOO depicted in FIG.^( the communication 
channel 182 is the storage network 130 itself: Any suitable technique may 
be used to translate commands, faults, and responses to network 
messages. The communication channel 130 may employ any type of 
known data rrsmsfer protocol such as TCP/IP. The particular interactions 
between the functions and activities of the controller 120 are described irn 
greater detail below. 

FIGS. -4i£f»43 schematically illustrate the contents of the table 200. 
As described above , the table 20O contains entries 2 10 (rows) that include 
a mapping between one or more virtual disk segments 220 and storage 
locations 230 on the storage containers 160. The storage locations 230 
identify the particular storage container 160 and part of the storage 
container ISO that corresponds to the virtual disk index 220. Tho form for 
the storage locations 280 must be appropriate for the storage network 
being used. In a SCSI network, each of the storage locations 230 includes 
a LUN identifier 233 and a block identifier 235, also called an offset. All of 
the other fields in a mapping table entry 210 are simple integers or binary 
state values. 

As depicted in FIG. \&A^ tKe mapping table 200 may have one entry 
210 per each "disk block" of the virtual dish: 220. While possible to build, 
this would result in huge mapping tables and highly fragmented mapping, 
both of which introduce undesirable performance degradations. In 
another embodiment, each mapping table entry 210 represents a variable 
sized group of contiguous virtual disk blocks that map to contiguous blocks 
on one of the physical storage containers 160. This configuration of the 
table 200 offers greater mapping flexibility and dense mapping structures, 
but introduces aJgorit h-rrri c complexity in managing the variable sized 
extents and greater map entry lookup costs. 

/ 3 
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In response, the table 20O of a preferred embodiment uses mapping 
table entries 210, each having a fixed size number of contiguous blocks 
Csegments") on the virtual disk 150 that map to one storage container 
160. as depicted in FIG.\3bf In this embodiment, each of the entries 210 
contains a virtual disk segment 220 instead of a virtual disk block. The 
block identifier 235 likewise identifies a corresponding segment of actual 
storage blocks. While FIG.W/iUustrates the table 200 identifying an 
entire range of values for the mapping table entry 220 and the block 
identifier 235, the table 200 could likewise identify only the beginning or 
end block where the size of the actual and virtual storage segments is 
otherwise defined. While this configuration of KTG^^for the table 200 is 
possibly not as dense as variable sized extent mapping, the configuration 
offers the simplest and highest performance map access and space 
management. Regardless of the specifics of the table 200, the table 200 
must map a virtual disk segment 220 to each physical storage block 
involved in I/O operations. 

In a preferred embodiment, the system lOO has multiple tables 200, 
each having different mappings between a virtual disk 160 and the 
storage containers 160- In this way, different hosts 140 may have 
different access to the same storage container 160. Where the table 200 
does not include one of the storage locations 230, hosts 140 using this table 
CLe., the hosts 140 connecting to the agent HO that stores this table) 
cannot access information stored at the storage location. In fact, the host 
140 will not realize that this storage location 230 exists. 

During operation, the host 140 issues an I/O operation (e.g.. read or 
write) to some block or blocks on a virtual disk 150. Bach virtual memory 
block is represented in the mapping table 200. either as an individual 
entry or as part of a virtual disk segment 220. Each block contained in the 
I/O operation is mapped to the appropriate location on the storage 
container 160. The mapping agent 110 issues a corresponding I/O 
operation issued to the storage container 160. The I/O operation results 
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are then collected and presented as a completed operation on the virtual 
disk 150. 

In addition to mapping information specifying the storage location, 
each mapping table entry 210 also contains several states. The states are 
Boolean variables that provide information on the current status of the 
virtual disk segment. These states are important because they allow the 
table 200 stored in the agent 110 to be remotely loaded and manipulated 
from the controller 120. These states and interfaces provide the ability for 
the mapping tables to be distributed and for the mapping table entries to 
be volatile. The disclosure first describes the states and, then, explains 
some of the functions for the states. The table 200 includes at least an 
invalid state 240 indicating whether any I/O operations may occur on the 
virtual disk segment 220 and its corresponding physical location 230. The 
invalid state may be activated during a first I/O operation to prevent 
further I/O operations until completion of the first I/O operation. In a 
preferred embodiment, the table 200 further includes a no- write (Nw) 
state 260 that indicates whether the data contained at the corresponding 
physical location 230 may currently be changed. The Nw state 250 allows 
for improved storage system performance because it permits data to be 
read during another I/O operation. The invalid state 240 and the Nw state 
250 function during dynamic loading of mapping table entries, dynamic 
mapping changes, volatility of mapping table entries, and data sharing 
among similar virtual disks 160. 

When activated, the invalid state 240 generally indicates that the 
mapping table entry 210 contains no useable mapping information and 
cannot support I/O operations. Any attempt to implement an I/O 
operation through this table entry 210 causes the mapping agent 110 to 
send a fault message to the controller 120. The agent 110 does not proceed 
with the I/O operation until the controller 120 returns a fault response. In 
one embodiment, the system 100 initially activates the invalid state 240 
for all entries 210 in the table 200 when the table 200 is newly created. In 
this way, the table 200 ignores any residual entries in memory from prior 
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stored tables to insure that current entries are active and reliable. 
Similarly, the invalid state 240 may be activated when entry 210 is 
"forgotten" and lost by the mapping agent 110 volatile memory. If the 
invalid state 240 is activated in the entry 210, then all other values and 
states in the entry 210 are assumed to contain no valid information and 
are ignored. 

Because the tables 200 located in the mapping agents 110 are 
volatile, any failure or restart of the mapping agents 110 causes all of the 
entries 210 to have an active invalid state 240. A sustained loss of 
communication between the controller 120 and mapping agent 110 also 
causes I/O operations to stop, either by making all mapping table entries 
revert to an active invalid state 240 or by adding additional mechanisms 
to suspend I/O operations until directed by the controller 120 to resume 
I/O operations. This configuration allows the controller 120 to continue 
coordinating other mapping agents 110 by knowing that a failed or 
unreachable mapping agent 110 has been placed into a known state, 
providing the controller 120 high availability of data access to the 
surviving mapping agents 110. 

As presented above, the Nw state 250, when activated, indicates 
that any write operations to the virtual disk segment(s) 220 represented 
by the entry 210 cause the agent 110 to send a fault message the 
controller 120. The agent 110 does not allow the host 140 to write to the 
storage locations 230 until the controller 120 returns a fault response to 
deactivate the Nw state 250 or until the system 100 otherwise takes some 
action to write to a segment despite the active Nw state 260. Unlike the 
invalid state 240, the activated Nw state 250 prevents operations from 
generating faults. Instead, the agent 110 generally allows the host 140 to 
proceed to access data at the storage location 230. Accordingly, if only the 
Nw state is activated, table entry 210 must contain a useable storage 
location 230. 
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In another embodiment, the table 200 further includes a zero (Z) 
state 260. When active, the Z state 260 indicates that the virtual disk 
segment 220 represented by the entry 210 contains all zero bytes. This 
feature allows a virtual disk 150 to be crested and appear to be initialized 
without the need to allocate or adjust any underlying non-virtual storage. 
If an entry 210 contains an active Z state 260, the agent 110 ignores the 
storage address 230. If the host 140 attempts to read information stored 
at storage address 230, the agent 110 returns only zero-filled blocks 
regardless of the actual contents of the storage address 230. On the other 
hand, any attempts to write data at the storage address 230 when Z state 
260 is active cause the agent 110 to send a fault message to the controller 
120. The agent 110 does not allow the host 140 to write to the storage 
locations 230 until the controller 120 returns a fault response that 
deactivates the Z state 260 or until the system 100 otherwise takes some 
action to write to a segment despite the active Z state 260. 

In another configuration, the mapping table 200 further includes an 
error (E) state 270. When active, the E state 270 indicates the existence of 
an error condition and provides the information necessary to instruct the 
agent to return an error without disrupting any previous state. The E 
state 270 is used where a pre-existing failure is known and such failure 
would cause any attempts at I/O access to fail. It should be noted, 
however, that the E state 270 could also be used as the means to issue an 
error status from a mapping fault. If an entry 210 contains an active E 
state 270, the agent 110 ignores the storage address 230. If the host 140 
attempts to read from or write to the storage address 230, the agent 110 
returns an error to the host 140. 

The interaction of the agent. 110 and the controller 120 is now 
described in greater detail. In one category of interactions, fault/response 
operations, the agent 110 sends a message to the controller 120 indicating 
the occurrence of a fault during an I/O operation to the table 200. 
Typically, the fault occurs as a result of an activated state (as described 
above) that prevents the execution of the I/O operation by the agent. The 
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agent 110 sends the fault message is to the controller 120. The controller 
then determines an appropriate action and commands the agent 110 
accordingly. 

In one type of fault/response operation, a map fault, the mapping 
agent 110 alerts the controller 120 that an I/O operation requested by the 
host 140 cannot be completed because the mapping table entry 210 has an 
activated state that prevents the completion of the requested I/O 
operation. For example, the mapping agent 110 produces a fault message 
to the controller 120 in response to a request for any I/O operation to a 
table entry 210 having an active invalid flag 240 or an attempt to write to 
storage address 230 having an active corresponding Nw flag 250. The 
map fault message from the agent 110 generally identifies the requested 
I/O operation, the virtual disk segment 220 involved, and the table state 
preventing the I/O operation. When the fault occurs, the agent does not 
attempt to carry out the I/O operation. Instead, the controller 120 uses 
the fault message to respond to the faulted I/O operation (e.g. load map 
entry, change map entry, delay until some other operation has completed). 
The controller 120 response informs the mapping agent 110 how to 
proceed to overcome the cause for the fault. 

The controller 120 generally instructs the agent 110 either to 
resolve the problem or to send an error message to the requesting host. 
When resolving the problem, the controller 120 sends a replacement table 
entry 210. The agent 110 then inserts the new table entry 210 in the table 
(in place of the former faulty entry) and then retries I/O operation. If the 
controller 120 cannot resolve the problem, it instructs the agent 110 to 
issue an error message to the host. To cause the agent 110 to issue an 
error message, the controller instructs the agent to activate the error state 
260 for the table entry 210 causing the fault. As described above, the 
agent 110 then issues an error message to the host 140 regardless of the 
other contents of the table entry 210. 

IS 
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Commands to the agent 110 initiated by the controller 120 comprise 
a second category of interactions, command/response operations. Among 
these commands initiated by the controller 120 include the creation of a 
new mapping table 200 with all entries set to have an active invalid flag or 
the deletion of an existing table 200. The controller 120 may obtain from 
the agent 110 the contents of one of the entries 210 or the status of the one 
of the states in this entry 210. The controller 120 can further order the 
agent 110 to set all of the contents for one of the entries 210 or the status 
of one of the states for the entry 210. It should be noted that once the 
invalid state 240, the error state 260, and the zero state 270 are active, the 
controller 120 cannot deactivate the state because, as. described above, 
initial activation of these states voids the storage address 230. To 
deactivate these states, the controller 120 must instruct the agent 110 to 
replace the existing entry 210 with an entirely new entry. To each of 
these commands, the agent 110 returns a response to the controller 120 
after completing the ordered task. 

When the controller 120 instructs the agent to either set or obtain 
information from the table 200, the system optimally allows the controller 
120 to specify multiple, contiguous map table entries 210 in a single 
command. This arrangement allows the agent 110 and the controller 120 
to interact more efficiently and with fewer instructions. 

When the controller 120 commands the agent 110 to set one or all of 
the values and states in the table entry 210, the controller 120 command 
to the agent 110 optimally includes a blocking flag. The blocking flag 
instructs the agent 110 to delay responding to the controller 120 command 
until after the completion of any I/O operations initiated before the 
command. The blocking flag applies to only that command in which it 
resides. Other concurrent commands and subsequent commands are not 
generally affected by the blocking flag of one command. In particular, the 
agent 110 immediately changes the table 200, as instructed in the 
command, but does not notify the controller 120 of this change until 
completing all previously existing I/O operations. In this way, the agent 

19 
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110 signals to the controller 120 the completion of all I/O operations using 
the former table 200 that do not reflect the changes to the table specified 
in the command. 

During a majority of the operation, the mapping agent 110 operates 
without faults. In non-fault cases, i.e. the mapping table entries 210 are 
valid and do not have any active states that prevent the requested I/O 
operation, the virtual disk I/O operates entirely through the mapping 
agent 110. Thus, all I/O operations proceed through the mapping table 
200 and directly to the physical storage containers 160 without any 
involvement of the controller 120. As a result, the controller 120 inserts 
itself into an I/O stream only when needed to perform various 
management operations and typically does not become involved in non- 
faulting cases, allowing the system 100 to have high performance and 
scalability. 

The foregoing description of the preferred embodiments of the 
invention has been presented for the purposes of illustration and 
description. It is not intended to be exhaustive or to limit the invention to 
the precise form disclosed. Many modifications and variations are possible 
in light of the above teaching. It is intended that the scope of the 
invention be limited not by this detailed description, but rather by the 
claims appended hereto. The above specification, examples and data 
provide a complete description of the manufacture and use of the 
composition of the invention. Since many embodiments of the invention 
can be made without departing from the spirit and scope of the invention, 
the invention resides in the claims hereinafter appended. 
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4 . Brief Description of Drawings 



FIGS. XA-lBjI>RIOIt AHT] are known systems for connecting a host 
to a storage device; 

FIGS- 2A-2C tPRIOIl AKT} are known virtualized storage area 
networks; 



storage area network in accordance with embodiments of the present 
invention; and 
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. 1 - Abstract 

The present invention provides a system and method for creating 
virtu ali zed storage in a storage area network using distributed table- 
driven input/output mapping. The present invention distributes the 
virtu alization mapping in multiple parallel, mapping agents that are 
separate from a controller. This allows the performance- sensitive 
mapping process to be parallelized and distributed optimally fin* 
performance, while the control of the mapping may be' located in a 
controller chosen for optimal cost, management* and other implementation 
practicalities. The mapping agents store the virtual mapping tables in . 
volatile memory, substantially reducing the cost and complexity of 
implementing the mapping agents. TTte controller is responsible for 
persistent storage of mapping tables, thereby nfmflnl^fthwg the costs and 
management for persistent mapping table storage in a xfngln component. 
Distributed virtualization also allows the controller to manage multiple 
virtual disks used by multiple host systems, and allows a single virtual 
disk to be shared by multiple host systems. The mapping agents 
preferably do not interact onry with other mapping agents, thereby 
improving the scalability of the virtual storage system and the virtual 
storage system's tolerance of component failures. 

2 . Representative Drawing 
Pig. 3 
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